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Abstract. We call a depth-4 formula C set-depth-A if there exists a (unknown) par- 
tition X\ U • ■ • U Xd of the variable indices [n] that the top product layer respects, 
i.e. C(x) = 53 4=1 IIj = i fi,j( x Xj), where fij is a sparse polynomial in F[a;Xj]- Extend- 
ing this definition to any depth - we call a depth-A formula C (consisting of alternating 
layers of E and IT gates, with a E-gate on top) a set-depth-A formula if every IT-layer in 
C respects a (unknown) partition on the variables; if A is even then the product gates 
of the bottom-most IT-layer are allowed to compute arbitrary monomials. 

In this work, we give a hitting-set generator for set-depth-A formulas (over any field) 
with running time polynomial in exp((A 2 log s) A_1 ), where s is the size bound on the in- 
put set-depth-A formula. In other words, we give a gittm-polynomial time blackbox poly- 
nomial identity test for such constant-depth formulas. Previously, the very special case 
of A = 3 (also known as set-multilinear depth-3 circuits) had no known sub-exponential 
time hitting-set generator. This was declared as an open problem by Shpilka & Yehuday- 
off (FnT-TCS 2010); the model being first studied by Nisan & Wigderson (FOCS 1995). 
Our work settles this question, not only for depth-3 but, up to depth elog s/ log log s, for 
a fixed constant e < 1. 

The technique is to investigate depth-A formulas via depth-(A — 1) formulas over a 
Hadamard algebra, after applying a 'shift' on the variables. We propose a new algebraic 
conjecture about the low-support rank- concentration in the latter formulas, and manage 
to prove it in the case of set-depth-A formulas. 



1. Introduction 

Polynomial identity testing (PIT) - the algorithmic question of examining if a given 
arithmetic circuit computes an identically zero polynomial - has received some attention 
in the recent times, primarily due to its close connection to circuit lower bounds. It is 
now known that a complete (blackbox) derandomization of PIT for depth-4 formulas, via 
a particular kind of pseudorandom generators, implies VP ^ VNP (an algebraic analogue 
of the much coveted result: P 7^ NP). It is also known that VP ^ VNP, which amounts 
to proving exponential circuit lower bounds, must necessarily be shown before proving 
P 7^ NP ( |Val79| rSV85j ) . Blackbox identity testing (equivalently, the problem of designing 
hitting-set generators), being a promising approach to proving lower bounds, naturally 
calls for a closer examination. Towards this, some progress has been made in the form of 
polynomial time hitting set generators for the following models: 

• depth-2 formulas [KS01J, 

• depth-3 formulas with bounded top fanin [ASSS12, ISSllj . 

• depth-4 (bounded depth) constant-occur formulas [ASSS12 , 

and a quasi-polynomial time hitting-set generator for 

• multilinear constant-read formulas |AvMVl"l] . 



Key words and phrases, polynomial identity testing, hitting-set, set-multilinear formula, Hadamard 
algebra, shift, low-support rank concentration. 
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among some others (refer to the surveys |SY1CH I5*ax091 |A"B09] ). The hope is, by studying 
these special but interesting models we might develop a deeper understanding of the nature 
of hitting sets and thereby get a clue as to what techniques can be lifted to solve PIT in 
general (i.e. for depth-4 formulas). One such potentially effective technique is the study 
of partial derivatives of formulas. 

Despite the apparent difference between the approaches of [ASSS12] and |AvMVli] , at 
a finer level they share a common ingredient - the use of partial derivatives. The partial 
derivative based method was introduced in the seminal paper by Nisan and Wigderson 
[NW97j for proving circuit lower bounds, and since then it has been successfully applied 
(with more sophistications) to prove various interesting results on lower bounds, identity 
testing and reconstructio n of circuits [ASSS121 lAvMVlll |GKQ12j IGKKS12| (refer to the 
surveys [SY10| ICKW11] for much more). 

Partial derivatives & shifting - the intuition: In a way, partial derivatives shift 
the variables by some amount - for e.g., if f(x±,X2, • • • , x n ) is a multilinear polynomial 
then its partial derivative with respect to x\ is f{x\ + 1, X2, ■ ■ ■ , x n ) — f(xi, . . . , x n ). Out 
of curiosity, one might ask what happens if we shift the polynomial by arbitrary field 
constants? If we shift a monomial /(x) = x\X2 ■ ■ ■ x n by c = (ci, . . . , c n ) G F n , a 7^ 0, 
we get the polynomial /(x + c) = (x\ + c\)(x2 + C2) • • . (x n + c n ). Something interesting 
has happened here: The polynomial f(x + c) has many low-support monomials. By a 
low-support monomial, we mean that the number of variables involved in the monomial 
is less than a predefined small quantity, say I. 

Is it possible that shifting has a similar effect on a more general polynomial /(x), i.e. 
/(x + c) has low-support monomials with nonzero coefficients, if / 7^ 0? Surely, this is 
true if c is chosen randomly from ¥ n (by Schwartz-Zippel [Sch80, Zip79|). But, / is not 
just any arbitrary polynomial, it is a polynomial computed by a formula (say, depth-3 or 
depth-4 formula). This makes it an interesting proposition to investigate the following 
derandomization question: Let / / be a polynomial computed by a formula. Is it 
possible to efficiently compute a small collection of points TcF n , such that there exists 
a c G T for which /(x + c) has a low-support monomial with nonzero coefficient? 

If the answer to the above question is yes, then it is fairly straightforward to do an 
efficient blackbox identity test on /: For the right choice of c G 7", <?(x) = /(x + c) 7^ 
has a low-support monomial. To witness that <?(x) / 0, it suffices to keep a set of t 
variables intact and set the remaining n — £ variables to zero in g; running over all possible 
choices of I variables whom we choose to keep intact, we can witness the fact that g 7^ 0. 
Since I is presumably small, g(x) restricted to I variables is a sparse polynomial which 
can be efficiently tested for nonzeroness in a blackbox fashion [KS01]. 

Indeed, we prove that the above intuition is true for the class of set-depth-A formu- 
las (precisely defined in Section - a highly interesting class capturing many other 
previously studied models (see Section including set-multilinear depth-3 circuits. 

Set-multilinear depth-3 circuits: A circuit C = Yli=i TYj=i fi,j{ x x 3 ■) is called a set- 
multilinear depth-3 circuit if X\ U . . . U is a partition of the variable indices [n] and 
fi,j( x x ) is a linear polynomial in the variables xx i.e. the set of variables corresponding 
to the partition Xj . The set-multilinear depth-3 model, first defined by |NW97] , kicked off 
a flurry of activity. Though innocent-looking, it has led researchers to various arithmetic 
inventions - the partial derivative method for circuit lower bounds |NW97] . noncommu- 
tative whitebox PIT [RSOB], the relationship between tensor-rank and super-polynomial 
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circuit lower bounds [R azlOj . hitting-set for tensors, low-rank recovery of matrices, rank- 
metric codes [FS12J, and reconstruction (or learnability) of circuits [KS06i. Although, an 
exponential lower bound for set- multilinear depth-3 circuits is known [NW97J IRY09] , the 
closely associated problem of efficient blackbox identity testing on this model remained 
an open question, until this work. 

Our contribution: Hitting set for set-depth-A formulas - A whitebox deterministic 
polynomial time identity test for set-depth-A follows from the noncommutative PIT results 
[RS05]. We are interested in blackbox PIT and, naturally, we cannot see inside C and the 
underlying partitions of [n]. The only information we have is the circuit-size bound, s. To 
our knowledge, there was no sub-exponential time hitting-set known for the set-depth-A 
model. Our work improves this situation to quasi-polynomial for any underlying field 
(refer Theorem [I]). We remark that even the very special case of set-multilinear depth-3 
circuits had no sub-exponential hitting-set known (see |SY10| Problem 27]); closest being 
the recent result of [FS12] where they give a quasi-polynomial hitting-set for tensors, 
i.e. the knowledge of the sets X\, . . . , X^ is required. 

Furthermore, set-depth-4 covers other well-studied models - diagonal circuits |Sax08| 
& semi-diagonal circuits [SSS12 - that had whitebox identity tests but no blackbox sub- 
exponential PIT were known. For these (and set-multilinear depth-3), our hitting-set has 
time complexity s°^ ogs \ although, for general set-depth-4 it requires s°^ og s \ 

Depth-4 formulas being the ultimate frontier for PIT (and lower bounds) (AV08J, one 
might wonder about the utility of our result on hitting-set for set-depth-A formulas be- 
yond A = 4. It turns out that there is an interesting connection: We show that a 
quasi-polynomial hitting set generator for set-depth-6 formulas implies a quasi-polynomial 
hitting set generator for depth-3 formulas of the form C = Yli=i TYj=i fiji&XjY*' 3 > where 
XiU. . - UXd defines a partition on [n] and fij are linear polynomials. Since arbitrary pow- 
ers eij > are allowed, the above depth-3 model is stronger than set-multilinear depth-3 
formulas (as there is no restriction of multilinearity) . This appears to be temptingly close 
to the general depth-3 model modulo the partition on variables, and provides us with a 
good motivation to understand the strength of our approach against depth-3 formulas. 

Technical novelty of our approach - As mentioned before, many works have looked 
at the partial derivatives of a formula and related matrices, e.g. the Jacobian [ASSS12, 
BM Sllj . From a geometric viewpoint, the study via derivatives shifts the variables by an 
infinitesimal amount and hopes to discover interesting structure. We take a more radical 
approach; we shift the circuit by formal variables and look at how the circuit changes 
by considering a transfer matrix T. The transfer matrix originates from the study of a 
formula with field coefficients via a simpler one having Hadamard algebra coefficients. This 
makes the transfer process more amenable to an attack using matrices and linear algebra; 
proving properties that are vaguely reminiscent of the case of top-fanin k = 1. 

The main technicality lies in proving the invertibility of a transfer matrix, which is 
an exponential-sized matrix. Some of the arguments here are combinatorial in nature 
involving greedy and binary-search paradigms. 

Although, Hadamard algebra is implicit in the whitebox identity test of [RS05] and the 
study of PIT over commutative algebras of [SSS09] (Theorem 6 in [SSS09] ). the novelty 
of our approach lies in understanding the effect of shift by viewing it through the lens of 
Hadamard algebra, and thereby observing the remarkable phenomenon of low-support rank 
concentration, which in turn implies that a low-support monomial survives after shifting. 

We state our results more precisely now. 
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1.1. Our results. Set-depth & set-height formulas - Let C be an arithmetic formula over 
a field F in n variables x, consisting of alternating layers of addition (S) and multiplication 
(n) gates, with a S-gate on top. The number of layers of LT-gates in C is called the product- 
depth (or simply height) of C and will be denoted by H. Naturally, the depth of C - which 
is the number of layers of gates in C - is either A = 2H or 2H + 1. Counting the LT-layers 
from the top, we label these layers by numbers in the range [H] and will be referring to a 
layer as the h-th H-layer in C, for h £ [H]. 

We say that C is a set-depth-A formula if for every h-th. H-layer in C, there exists 
a partition Xh,i U • • • U Xh,d h of variable indices [n] that the product gates of the h-th 
LT-layer respect. In other words, for every h E [H] the i-th product gate in the h-th 
Il-layer computes a polynomial of the form Ylf=i fi,j( x x h j), where each fij(xx hj ) is a 
set-depth-(A — 2h) formula of height H — h on the variable set xx h ■■ If A = 2H then the 
product gates of the H-th Il-layer are allowed to compute arbitrary monomials, i.e. here 
the .H-th LT-layer need not respect any partition of the variables. 

We will also refer to C as a set-height-H formula. Size of C, denoted by s or |C|, is the 
number of gates (including the input gates) in C. 

Theorem 1 (Main). There is a hitting-set generator for set-height-H formulas, of size s, 
that runs in time polynomial in exp((2i7 2 logs)^ +1 ), over any field ¥. 

Remarks. 1. For blackbox PIT of set-multilinear depth-3 formulas this gives a quasi- 
polynomial time complexity of s ( logs ) - this is the first sub-exponential time algorithm. 
2. For constants H > 1 the formula may not be multilinear, though the hitting-set 
remains quasi-polynomial. The time complexity remains sub- exponential up to H = 
e logs/ log logs, for a fixed constant e < 1 . 

An interesting model that is not set-depth-A but still Theorem [I] could be applied is - 
semi-diagonal formula. The reason being the duality transformation [Sax08, SSS12] that 
helps us view it as a set-depth-4 formula. We recall - a depth-4 (SLTSII) formula C is 
semi-diagonal if, for all i, its i-th (top) product-gate computes a polynomial of the form 
m i ' 11^=1 ftf > w bere mi is a monomial, fij is a sum of univariate polynomials, and b is a 
constant. We give two applications, with similar proofs but, for different looking formulas. 

Corollary 2 (Semi-diagonal depth-4). There is a hitting-set generator for semi-diagonal 
depth-4 formulas, of size s, that runs in time s°( logs ) (assuming char(¥) zero or large). 

Corollary 3 (Set-depth-3 with powers). Consider a depth-3 formula C = X^i=i YYj=i 
fi,j(xx ) £i ' j , where foj is a linear polynomial in ¥[xx], ei,j & N, and X\ U ••• U 
partitions [n] . There is a hitting-set generator for such formulas, of size s, that runs in 
time s°( log s ) (assuming char(¥) zero or large). The result continues to hold even if f%j 
is a sum of univariates. 

Remarks - The restriction on char(F) in the above two corollaries comes from the use of the 
duality trick. We think this restriction can be lifted by using Galois rings ( |Sax08llSSS12| ). 
and defining rank for a Hadamard algebra over a Galois ring appropriately. We avoid 
working out the details here just to keep the focus on the main contributions of this work. 

1.2. Organization. We develop an extensive terminology in Section [2j which would be 
useful later. This section also shows the proof idea at work for the example case of diagonal 
circuits. Section [3] proves the first structural property - a small shift ensures low-block- 
support rank- concentration in a product of polynomials, that have disjoint variables and 
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only low- weight monomials. Starting with this as a base case, Section [4] proves the second 
structural property - a small shift ensures low-support rank-concentration in set-depth-A 
formulas (thus, achieving the presence of a low-support monomial). Finally, the proofs of 
our main results (or hitting-sets) are completed in Section [5] 

2. The basics 

2.1. Polynomials. Let N := Z^o and [n] := {1, . . . , n}. Let R be a commutative ring. In 
the motivating cases R will be a field F, which we implicitly assume to be large enough. 
This we can do as the required field extensions are constructible in deterministic polyno- 
mial time |AL86j . further, as in blackbox PIT we are allowed to evaluate the circuit over 
any 'small' field extension. 

Not always will we use bold notation for a vector, hopefully the context will avoid 
the confusion. For a vector e G Z ra we define |e| := Yli e i- Also, let the support be 
S(e) := {i | e-i / 0} and the weight s(e) be its size. For an exponent vector e G N n , we 
define a coefficient operator Coef (e) : R[x] — > R that on a polynomial / G R[x] equals the 
coefficient of x e in /. Clearly, it is an .R-module homomorphism but is not multiplicative. 
Define the support of f as S(/) := {e G N" Coef(e)(/) / 0} and the sparsity s(f) be its 
size. The monomial-weight of f is n(f) '■= max eg g(j) s(e). Further, define the cone of f as 
S(f) := {e' G N n | 3e G S(/), e' ^ e}, where the inequality is coordinate-wise, and its size 
as s(/). Note that for a sparse polynomial /, s(/) is small but s(f) is usually exponential. 

Lemma 4 (Cone). For an n-variate polynomial f, of degree bound d and monomial- weight 
V,wehave 5 {f)^{ n l l )-{ d+ f). 

For u,v,a G N n define v\ := Uie[n)^ O ■= Uie[n] © = ^hy, and a™ := 
llieW a T~ U ' ■ We keep in mind the conventions: For all a < b G N, (^) = and (q) = 1. 

Lemma 5 (Shift on monomials). Let u, v G N n , a>\, . . . ,a n G R and f = Y\ ie ^(xi + ai) Vi . 
Then, Coef(n)(/) = ( v u ) -a v ~ u . 

For a polynomial / a shift does not change //(/) but, might blow up s(/) exponentially. 

2.2. Hadamard algebras. For a commutative ring R and k G N, we define the Hadamard 
algebra H K (R) := (R K ,+,-k), on the free i?-module R K , by defining: u*v := (ui ■ Ui)ig[ K ], 
where • is the multiplication in R. H K (i?) is an i?-algebra (it is closed, associative, dis- 
tributive and commutative) with the zero vector as zero and the all-one vector as unity. 

We can now naturally define the polynomial ring over H K (i?), H K (i?)[cc]. It inherits 
the operations +,*, and all the elements of H K (i?). Also there is an obvious isomorphism 
between the algebras H K (i?)[:c] and H K (R[x}). (View the elements of H K (i?) and H K (R)[x] 
as 'column vectors' with entries from R and R[x], respectively.) 

For an e G N n and / G H K (i?)[a;], we have the natural notions - coefficient operator 
Coef(e) : R K (R)[x] -)• H K (i?), support S(/) C N n , and sparsity s(/). 

Low-support coefficient- space - For any polynomial / over a Hadamard algebra H K (i?), 
where R is a field, and t G N >0 , define V e \f) := sp R {Coef(e)(/) | e G N n ,s(e) < £} C 
R K (R). We call / £- concentrated over B K (R) if V e (f) = s Pi? {Coef(e)(/) | e G N n }. 

We can extend the above definition also to the case when R is an integral domain, as 
we can then work with the associated field of fractions. 

We demonstrate the usefulness of Hadamard algebra & 'shifting' in achieving low- 
support rank concentration, using the example case of diagonal circuits (see Section |A|) . 
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2.3. Proof ideas. With the spirit of the argument (as in Section [A]) in mind, let us state 
the proof ideas. Let C(x) = Yli=iYYj=i fi,j( x Xj), where fj is a sparse polynomial in 
F[a3x,]> be a set-depth-4 formula. Consider a ITXn formula 

D(x) := fi(x Xl )*--'*fd(xx d ) over H fe (F), 

where the i-th coordinate of fj(xxj) is fi,j{xxj)- Note that C(x) can be expressed as 
(1, 1, . . . , 1) • D(x), where • is the usual matrix product. Denote (1,1,..., 1) by 1. 

For a subspace l^CF^ and polynomials D\, D2 £ Hfe(F)[aj], we say D\ = D2 (mod V) 
if each coefficient of D\ — D2 is in V. Somewhat wishfully, we would like to propose a 
low-support rank- concentration property: 

Conjecture 6 (Wishful!). If I > \og\D\ then D(x) = (mod Vg(D)). 

If this is true then the coefficient of x e , in D, is in the F-span of those coefficients that 
correspond to low support, i.e. 0(log \D\). Suppose we verify the zeroness of its o C(x) = 
1 • D(irsx), for S G (J_i) an£ i n s '■ x i ^ ( x i if * € S, else 0). This means that Ve £ N n 
with s(e) < I we have 1 • Coef (e)(D) = 0. Now the conjecture implies that also Ve G N n 
with s(e) I we have 1 • Coef(e)(Z?) = 0, clearly implying, C(x) = 1 • -D(tc) = 0. In other 
words, we have a blackbox PIT for set-depth-4 in time poly(n log ' c ''). 

Unfortunately, Conjecture [6] is easily false! For example, let D(x) = x\ ■ ■ ■ x n and 
1 < £ < n. Then obviously D(x) ^ (mod V e (D)). 

Here is where 'shifting' enters the picture. The goal in this paper is to prove that after a 
'small' shift of the variables, D begins to satisfy something like Conjecture]!)} This requires 
a rather elaborate study of how a formula changes when shifted; the meat is expressed 
through certain transfer equations. Looking ahead, we conjecture (without proof) that 
the phenomena continue to hold in general constant-depth formulas. 

2.4. Set-height formulas over Hadamard algebra. Just as we have defined set-height 
formulas over a field F - meaning, the underlying constants come from F, we can also define 
set-height formula in a natural way over any Hadamard algebra H K (i?). The reason we 
can extend the definition to arbitrary H K (i?) is that the defining property of set-height 
formulas is the existence of a partition of variables for every n-layer (irrespective of where 
the constants of the formula come from). Size of a formula C over H K (ii) is defined as k 
times the number of gates in C. 

Let C be a set-height-ff formula (over F) of depth A - we will count depth of C from 
the top, i.e. the top S-gate is at depth 1. If A is even (resp. odd) then the gates of 
the bottom- most S-layer compute sparse polynomials (resp. linear polynomials) in the 
variables. Let k be the maximum among the fanin of the S-gates of C (barring the gates 
of the bottom-most S-layer), and d the maximum among the fanin of the n-gates in C. 

Uniform fanin of S and H-gates - With the definitions of k and d as above, we can assume 
that the fanin of every E-gate in C (barring the gates of the bottom- most S-layer) is k, 
and fanin of every n-gate is d. This can be achieved by introducing 'dummy' gates: The 
'dummy' S-gates introduced as children of a n-gate compute the field constant 1, and the 
'dummy' n-gates introduced as children of a S-gate also compute 1 except that some of 
the field constants on the wires are set to zeroes. This process keeps C a set-height-i? 
formula but might bloat up the size from s to s A , although it does not change k and d 
(according to the way we have defined them). Of course, formula C is not modified physi- 
cally as it is presented as a blackbox. But the point is, even in the blackbox setting we can 
treat C as a set-height--ff formula with uniform fanin of E and n-gates. We will call this 
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uniform fanin of the S and LT-gates as the "S-fanin and U-fanin, respectively. Note that the 
definition of S-fanin excludes the gates of the bottom-most X-layer - they are handled next. 

Fanin bound on bottom-most T,-gates - If A is even, denote the set of monomials computed 
by the H-th LT-layer by M; if A is odd then M := x U {1}. The fanin of every gate of the 
bottom-most S-layer is bounded by A := \M\ + 1. Refer to A as the sparsity parameter. 

Henceforth, we will assume uniform E and II- fanin of C (k and d respectively), keeping 
in mind that the fanin of every gate of the bottom- most S-layer is bounded by A. All of 
k, d and A are in turn bounded by s. Denote this class of formulas over F by Co(k, d, A, x). 

Recursive structure of set-height formulas over Hadamard algebras - Let Ch(k,d, X,x) be 
the class of set-height- (H — h) formulas, of depth (A — 2h), in the variables x with S-fanin 
k, II- fanin d and sparsity parameter A, over the Hadamard algebra TZh ■= H fc h(F). (Eg., 
to begin with h = and the input formula C G Co(k, d, A, x).) Assume that k, d and A are 
less than s, which is the size of the input formula C. Let Ch be a formula in Ch(k, d, A, x). 

(1) C fc (aO = J>- [J fijfrxj), 

Ci G TZh, fij(x Xj ) is a set-height- (H — h — 1) formula over TZh on the variables x Xj , 
and X i U • • • U is the partition of [n] that the first n-layer of Ch(x) respects. Let 
TZ h+ i := B. k (TZ h ) = H fc ; I+ i(F). Define fj{x Xj ) ■= {h,j{x Xj ), ■ ■ . , f k ,j{x Xj )) T G Kh+i[x X j]- 
Let 

D h (x) := fi(x Xl )*---*fd(xx d ) = ]J fj{x Xj ) oveiTZ h+ i, 

where * denotes the Hadamard product in the algebra TZh+i (extended naturally to the 
polynomial ring over TZh + \). Evidently, 

(2) C h (x) = (ci, . . . , Cfe ) • D h (x) = c T ■ D h (x), 

where • is the product for matrices over TZh [x] ■ We intend to understand the nature of the 
circuit Ch(x) by studying the properties of the circuit Dh(x) - it is here that the recursive 
structure reveals itself as in Lemma[7j Let Vh(h') := {Xh\i, ■ ■ ■ , Xh\d} be the partition of 
[n] that the h'-ih n-layer of Ch respects. (Recall that when the depth of Ch is even then the 
bottom-most n-layer need not respect any partition - this attribute would always remain 
implicit in our discussions.) Define the partition Vh^h' , Xj) := {Xh> i HX-, . . . , Xh' .d^Xj} 
(ignore here the empty sets), for every 1 < j < d. 

Lemma 7. For every j G [d], fj{x Xj ) is a set-height- (H — h — 1) formula in TZh+i[x Xj ] 
with Yi-fanin k, H-fanin d and sparsity parameter X, i.e. fj(x Xj ) G Ch+i(k, d, A, x Xj ), such 
that every h'-th H-layer of fj{x Xj ) respects the partition Vh(h + l,Xj). (Pi. in App. [B| 

2.5. Matrices. A matrix M with coefficients in ring R, and the rows (resp. columns) 
indexed by 1 (resp. J) is compactly denoted as: M G (X x J — > R). M is simultaneously 
a map from X X J to R, and a R-linear transformation from R\^\ to R^. When R is an 
integral domain, we denote the rank by ik^M. Note that the row-rank and column-rank 
are equal for a matrix. We call a matrix M G (X x J — > R), |X| = |j7~| — 1, strongly 
full if for all u 6 J, Mi,j\{ u } 1S invertible. For two matrices M\,M2 and a i?-module 
V, we write M\ = M2 (mod V) to mean that each column of Mi — M2 is in V . For 
two matrices M\ G (Xi x J\ R) and M2 G (X2 x J2 — ^ R), the matrices Mf (when 
|Xi| = |Ji|), MiM 2 (when J x = X 2 ) and M x ® M 2 are in (J x xl^ R), (Xi x J 2 ->• R) 
and ((Xi x Z 2 ) x (Ji x J 2 ) — > R) respectively. For a matrix M G R KXa and an element 
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v G H K (R), v -k M is the matrix obtained after taking the Hadamard product of each 
column with v. For two matrices Mi G (X x J\ — > R) , M2 € (X x J 2 — )• i?) the Hadamard- 
tensor matrix Mi © M2 £ (X x {J\ x ^2) — ^ R) is defined as: Its (ji, j2)-th column is 
(Mi)xjj * (M2)xj 2 . We list some intuitive formulas. 

Lemma 8 (Matrices). For any column-vector v and matrices Ei, Mi, Zi, with suitable 
assumptions on the sizes and invertibility, we have: 

(1) ■ (®»Afi) = ®i(EiMi). 

(2) ©.M" 1 = (0,-Mi)- 1 . 

(3) (u*Afi)-M 2 = t;*(AfiAfa). 

(4) (ZiAfi) © (Z 2 M 2 ) = (Zi © Z 2 ) ■ (Mi © M 2 ). 

3. Low-block-support rank-concentration 

For % G [£], let /j G H K (F)[a?xJ be a polynomial of degree at most 5, where the Xj's are 
disjoint subsets of [n]. Define [i := maxj{/u(/j)}. By Lemma [4j the sparsity parameter 
A := max; {s(/i)} of the tfa is bounded by (5 + n + Define £ := 2 [log 2 k] + 1. 

Consider the depth-3 (LTSn) formula over H K (F), 

D := fi(x Xl )* fe(x Xt ) m H re (F) [x]. 

We shift it by formal variables t to get -D(cc + t) = fi(x Xl + txj * • ■ • * fi{x Xl + ijsQ) 
in H K (F[t])[x]. Wlog we can assume that, Vi G [£], is a unit in H K (F(i)). This is 

because not being a unit only means that the vector /, G F(t) K has a zero coordinate, 
say at place j G [ft]. Then the j-th coordinate of D(t) is zero, and we can forget this 
position altogether; project the setting to the simpler algebra H re _i(F). We normalize fi 
tofi(x) := frity^fiix + t). Define D'(x) := f[(x Xl ) * ■ ■ ■ * f e (x Xi ) in R K (¥(t))[x\. ' 

(3) D{x + t) = D(t)*L/(x). 

Any exponent e G N n , possibly appearing in D', can be written uniquely as e = X^eM e *' 
where e, G S(fi), because fa's are on disjoint set of variables. We will frequently use this 
identification. We define the block-support of e, bS(e) := {i G [l]\ei 7^ 0}, and let the 
block-weight bs(e) be its size. Based on this we define a relevant vector space, for I G N>o, 

Vi(D') := SPF(t) {Coef(e)(l/)|eGN n ,bs(e) < 1} . 

Ordering & Kronecker-based map - We define a term ordering on the monomials t e , e G N n , 
and their inverses. For a to G N n we denote the ordering as t e ^ w t e , or equivalently 
l/^ e z^™ if SisW ^ Siefn] w i e 'i- Note that the ordering is multiplicative on the 
monomials, equivalently, the induced ordering on the exponents is additive. 

For reasons of efficiency, useful later but skippable for now, we assume: -< w keeps 

the monomials |lJie[£] ^ I ^ e Mi e i £ ^(/i)} distinct. If we fix such a u> G N>o (note: 

it could be found in time \°^'), then the Kronecker-like homomorphism r : 1— >• 
(Vi G [n]) will obviously also map the aforementioned monomials to distinct univariate 
ones. We extend r to a homomorphism from H K (F[£])[a?] to H K (F[y])[a;], by keeping x 
unchanged. Its domain can be further extended to a subset of H K (F(t))[ac] (i.e. as long as 
r does not cause a division by zero). 

We would like to prove something like Conjecture [6] for D(x + i). Note that it suffices 
to focus on D'(x) as its coefficients are all scaled-up by the same nonzero 'constant' D{t). 
The rest of the section is devoted to proving the following theorem. 
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Theorem 9 (Low block-support suffices). D'(x) = (mod Vi(D')). Further, it remains 
true under the map r . 

3.1. Shift-&>normalizing D. We investigate the effect of shift-&-normalizing on /j. 

Write, for i G [£], =: £„. 

eS(fi) z i,Vi xV '- (Note: t>j G N™ and we will denote its 
j-th coordinate by Vij G N.) This yields, after shift-&-normalize (division by units is 
allowed in H K (F(i))), ' 

fKx):=fi(x + t)/fi(t)=: Yl z 'i^ Ui e H K (¥(t Xi ))[x Xi }. 
The last step defines 

(4) 4, Ui = Coaf («,)(/[) = my 1 * £ ^(r)^"" 1 

for all exponent vectors itj G S(f-) C <S(/|) = S(fi). The constant coefficient of /•, ^ = 1. 

3.2. Transfer equation of a single polynomial. Let / be one of the polynomials 
h,...,fi over H K (F). Let S := S{f) and S := S(f). For v G 5 define z v := Coef (u)(/), 
and := Coef (u )(/')• Since / is a unit, obviously, S ^ and 5 7^ 0. Let Z G ([k] x5 — > F) 
be such that: Its v-th column is the vector z v . Note that exactly s(/) of these columns 
are nonzero. Let Z' G ([k] X S — >■ F(t)) be such that: Its u-th column is the vector z' u . 
For any C C <S(/) we define a diagonal matrix Nq G (C x C — > F[t]) as: Its it-th diagonal 
element is t u . Let the transfer matrix (of XII formulas) T G (5 x 5 ->■ F) be such that: 
Its (t>,u)-th entry is Q). We are ready to state the promised transfer equation. 

Lemma 10 (Transfer equation - primal). Z' = f'{t)~ l * ZNgTNg 1 . (Pf. in Appendix [c|) 

For later use, we need a 'modulo' version of this transfer equation. As shorthand denote 
^[k]C ^ ^c> f° r an y C C 5. Note that the transfer matrix captures a transformation, 
from Z to Z' , which is clearly invertible. Thus, T is an invertible matrix. Define T" : = 
(Ts^)" 1 G (5 x S -> F) and 5* := 5 \ {0}. If 5* = then it only means that / G H K (F), 
and is invertible. Such an / could be dropped from D right in the beginning. From now 
on we assume S* 7^ 0. We deduce a modulo version now. 

Lemma 11 (Transfer equation - mod). We have f(t)~ 1 -kZ = Z' s ,Ns*Tg* gN^ 1 (mod z' ). 
Further, T^* s is strongly full. (Pf. in Appendix [C| 

3.3. Transfer equation of D: Hadamard tensoring. For two subsets B\,B2 C N n 
we define B\ + B2 := {61 + 62 I &i S B\,b% G -B2}, where the sum is coordinate- wise. For 
* G let 5» := S(fi) and 5* := 5j \ {0}. Define S := £i 6 [/| $ and 5' := £ ie 0«S*. 
Note that there is a natural identification between 5' and x i€ ^S*. We will be implicitly 
using this. For i G define Zj G ([«] X Si — >• F) such that: Its Uj-th column is the 
vector := Coef (nj)(/j). Let Z G ([k] x S — > F) such that: Its u-th column is the 
vector Zu := Coef (u)(D). Note that Z = © ie[£ ]Z- For i G [£], define Z- G ([«] X 5* -> F) 
such that: Its fj-th column is the vector z' iv . := Coef(vi)(f-). (Note that Z- has fewer 
columns than Zj.) Let Z' G ([«] x 5' — >• F) such that: Its u-th column is the vector 
2^ := Coef(w)(I? / ). Note that Z' = ®i^\£\Z' i . For any C C 5 we define a diagonal matrix 
Afc G (C x C -)• F[t]) as: Its u-th diagonal element is t u . For i G [£}, define T/ := T£* 5 .. 

Let the transfer matrix (of LTSII formulas) T' G (5' x S — > F) be <8>j e ^T/. 

Lemma 12 (Tf. eqn. depth-3). D{t)~ x *Z = Z'N S 'T'Ng l (mod V e (D')). (Pf. App. [cj 
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3.4. Combinatorial juggernaut: To select columns of T'. Recall that T' has rows 
(resp. columns) indexed by S' (resp. S) and has entries in F. Let Ai be some k > 
columns that we intend to remove from T'\ we call them marked and the others S\Ai are 
unmarked. We make the following claim about the submatrices of T" not involving Ai. 

Theorem 13 (Invertible minor). There exist unmarked columns C C S, \C\ = \S'\, such 
that \T' S i c \ ^ 0. (Proof in Appendix |cj) 

3.5. T 1 on the nullspace of Z: Finishing Theorem [9[ Recall that the columns of 
Z are indexed by S. Think of these ordered by the weight vector w, as discussed in the 
beginning of this section. Pick a basis Ai, size at most k, of the column vectors of Z by 
starting from the largest column. Formally, Ai gives the unique (once -< is fixed) basis 
such that for each u-th, u G S \M, column of Z there exist columns u\ , . . . , u r G Ai 
spanning the u-th column, and u -< u r -< ■ ■ ■ -< u\. We think of the columns Ai of T' 



marked, and invoke Theorem 13 to get the C C S. We define an A G (5 x C — > ¥): 
If a is the u-th column of A then Z ■ a = expresses the F-linear dependence of z v on 
{z v i | v' E Ai, v -< v'}; in particular, the least row where a is nonzero is the u-th, the entry 



being 1. Recall the transfer equation, Lemma 12 for the following. 

Lemma 14 (T" on nullspace of Z). |T'A r ^ 1 A| ^ 0. Further, the leading nonzero inverse- 
monomial in the determinant has the coefficient \T^, c \. (Proof in Appendix |Cj) 

Finally, we use A to finish the proof of our main structure theorem. 

Proof of Theorem [S| From the transfer equation, Lemma |j~2| we recall 

D(t)- 1 * Z = Z'N s ,T'N s l (mod V e (D')). 

Right-multiplying by A, we get 

(5) = Dity 1 * (ZA) = Z'N S >T'N S 1 A (mod Vt(D')). 

and Ns> is obviously invertible, we get 



Since T'N S 1 A is invertible from Lemma 
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Z' = (mod V t (D')). 

(Here we do use that the matrices are over F(t) and that Vi(D') is an F(t)-vector space.) 
This immediately implies the first part of Theorem [9| as Z' collected exactly those coef- 
ficients of D 1 that we a priori did not know in Ve(D'). The second part of the theorem 
follows easily as: (1) r keeps D{t) a unit, and (2) r corresponds to the correct term 
ordering < w . These two properties allow the above proof also work after applying r. □ 



4. Low-support rank-concentration 

We will prove that a set-height-i? formula, after a 'small' shift, begins to have 'low'- 
support rank-concentration. The proof is by induction on the height of the formulas over 
Hadamard algebras. For this, we would need the following concepts. 

For H > h G N, let t^ := {tff-i, ■ ■ ■ ,th+i>th} De a se t of formal variables and F(tft) 
be the function field. These t^-variables are different from the variables x involved in the 
formula C. Let lZ' h := H fe h(F(t/ l )) be a Hadamard algebra over F(t^); k h = dim^f th \TZ' h . 
Further, 1Z' h+l [th] denotes the (univariate) polynomial ring over 7l' h+1 , and lZ' h+1 (th) is 
the corresponding ring of fractions. (lZ' h+1 (th) is basically H fc h+i(F(t/,)).) 

Low-support shift for Ch(k, d, A, x) - Let be a map from ¥[x] to F(t/ l )[ic] defined as, 

i ,0-H-lA | , j. a h,i r _ _ 

T h : Xi H> Xi + a H -i,i t H _ x H h ah,i t ft , for Xi G 
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a H -i,i, • • • , a hji G Z+ and a H -i,i, • • • , &h,i G F. (r h fixes F, i.e. T h (c) = c for c G F.) In 
short, we will write Th ■ x \- > x + ah t^ h . For £h G N, the map Th (as above) is called an £h- 
support shift for the class of formulas Ch(k, d, A, x) if for every formula Ch G C/x(&> d, A, 
the polynomial Th(Ch(x)) = Ch(x + a^t^ 1 ) is ^-concentrated over 1Z' h . 
For the rest of our discussion, we will fix £h as follows, for H > h > 0: 



4 := 



(2i/[i/log 2 A:l) H - /l - 1 • 2 [iilog 2 (£;A)l + 1, if A is even, 

(2H\Hlog 2 k]) H ~ h + 1, if A is odd (& for h = H,£ H := 2). 



The above setting satisfies the relation 4 = — l)H(t— 1)+1, where £ := 2|\ff log 2 fe] + 
1, for every if — 1 > h > (and also for h = H — 1 when A is odd). 

Recall Equation[2]that says - for each h G {0, . . . , iT— 1} and Ch, there exists c G Hfc(7£h) 
such that Ch = c T ■ Dh- This section is dedicated to proving the following theorem. 

Theorem 15 (Low support suffices). We can construct tq such that tq o Dq is £q- 
concentrated over 7*^ [to], in time polynomial in (d + n + £o) , where n := \x\. 

Proof strategy ahead - The idea is to construct the map Th by applying induction on height 
H — h of the class Ch(k, d, A, x). By Equation [2j 

C h {x) = c T ■ (h(x Xl )*--*fd(xx d ))- 

FromLemma|7j fj{x Xj ) G Ch+i(k, d, A, x Xj )- By definition, r h+ i : Xj i-> Xi+aH-i,i ^'1'+ 

h aft+i,* t^Y' 1 is an -support shift for C/i+i(fc, d, A, xjf 3 ) for every 1 < j < d. Here 

is where we use induction on height H — h: We will build the map from the induc- 
tive knowledge of Th+i- Basically, we will show that it is possible to efficiently compute 
Oft.u • • • > a h,n G ^ + and a M , . . . , a hjn G F such that r h : (->■ r ft+1 (xi) + a h>i t h h ' 1 is an 
^-support shift for Ch(k, d, A, x). 



The proof of Theorem 15. The proof proceeds by induction on height H — h of the 
class Ch(k,d, X,x) (in other words, reverse induction on h). The induction hypothesis 
is that Th+i, an 4+i-support shift for the class Ch+i{k, d, A, £c), can be constructed in 
time polynomial in (d + n + 4+l) +1 ) where n := |a;|. Overall this means, by varying 
h G [0, 7J— 1], we get a hitting-set of size polynomial in TL h ~} (d + n + lhY h ^ (d + n + 
£ Q ^T.h e h < (^ _|_ n _|_ £ ) 2 ^o i \Ye discuss the base case and the inductive step in separate 
detail. Keep in mind that fj{xxj) G Ch+i(k, d, A, xx 3 )- 

4.1. Base case (h + 1 ^ H — 1). The base case is when H — h — 1 = 1 or 0, i.e. fj(xx )'s 
are sparse polynomials or linear polynomials over IZh+i, depending on whether A is even 
or odd, respectively. These two base cases have varying level of difficulty. If if — h — 1 = 
then lh+i — = 2, hence taking th as the identity map suffices (since fj(xx J ) , s are linear 
polynomials) as an ^-support shift for the class Cn(k,d, X,x). If if — h— 1 = 1 then 
fj(xx J ) , s are sparse polynomials. We first prove an, independently interesting, property. 

Lemma 16 (Sparse polynomial). Let f G H K (F)[a?] be a polynomial with degree bound 5. 
Let £':=! + min{2 [log 2 (« • s(/))] ,//(/)}. We can construct a map a : Xi h-> X{ + t bi , in 
time polynomial in (5 + n + £'Y , such that o~(f) is £' -concentrated over H K (F(t)). (Ap. [Dj 



Now we apply the lemma to the sparse polynomial fj(xxj), which has the sparsity 
parameter A. Hence we define r^+i = th-i '■ Xi 1— > Xi~\-t b ^_ l (in other words, an-i,i ■= h). 



This, by Lemma 16, ensures that the concentration parameter is 2 (log 2 (A; 1 • A)] + 1 ^ 
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2 \Hlog 2 (k\)~\ + 1 = £h-i = t-h+i- Finally, th-i is an ^#_i-support shift for the class 
CH-i(k, d, A, x), and it can be constructed in time polynomial in (d + n + £h-i) 1 ■ 

4.2. Induction (h + 1 to h). Let fj(x Xj ) '■= Th+i{fj(x Xj ))- Then, 

D h (x) := T h+ i{D h (x)) = fi(x Xl )*---*f d (x Xd ), 

where every fj is 4+i-concentrated over TZ' h+1 (by induction hypothesis). Let t := 
{th,i, ■ ■ ■ ,th,n} be a set of 'fresh' formal variables. (We will keep in mind that the t- 
variables would be eventually set as univariates in a variable t^.) As before in Eqn. |3j 

D h (x + t) = H f 3 (x Xj +t X] ) = \\ f j (t Xj )*ffa Xj )=D h (t)*D' h (x). 

In the same spirit as Theorem [9J we would like to show that D' h (x) = (mod Ve(D' h )), 
where V e (D' h ) := sp F(th+i>t) jCoef (e)^) | e G N",bs(e) < ij, and £ = 2r#log 2 k] + 1. 
As before (see 'key argument' in Lemma [To] ), it is sufficient to prove the typical case 
(i.e. product of the first £ polynomials), D' h e (x) := Y\je[e] fj(. x Xj) = (mod Vf.(D' he )) To- 
wards this, we define the truncated polynomials, gj(x Xj ) := J2 e -s(e)<e h+1 Coef (e)(/j) x x 
and let the corresponding product be Eh(x) := ITje[<f] 9j( x x 3 •)• Sparsity of gj(x Xj ) over 
is bounded by {d H ~ h ~ l + n + 4+i) £h+1 =: A/j. Mimicking the notations on let, 

sa(35 + *) = n + ^ = * and - n %^)- 

By Theorem [9J we can find a^i, ■ ■ ■ o,h,n 

G Z + in time {d\ h )°^ = (d + n + £ h )°^ such 
that by setting t^i = ah,it^ h ' 1 (any 07^ G F \ {0} works), where th is a 'fresh' formal 
variable, we can ensure that the following is satisfied: 

(6) Eh4(.x) = (modV e (E' h/ )). 

The claim is that the same setting th,i = oth^t^' 1 (now with carefully chosen cth/s) also 
ensures that D' he (x) = (mod Vi(D' h( )). Consequently, D' h is (£ - l)(4+i - L) + 1 < 4 
concentrated over TZ' h+1 (th)- This is what we argue next. Equation K3| implies 
(7) 

E h/ (x+at) = Y[ 9j{x x . + ol Xj t Xj ) = E hti (ai)-kE' hji {x) = (mod Vi(E h/ (x+at))), 

where (reusing symbol) t := (t^' 1 , • • • >*//''"') and a: := (a^ i, . . . , a^„). Define, Dh,l( x ) '■ = 
Tlj=i fj( x Xj)- We need to take a closer look at how the coefficients of Dh/(x), Dh,e(x + 
at), Eh£(x) and Eh t i(x + at) are related to each other. Towards this, define: 

Zj,u s ■= Coef(«^)(2(aJjfj)) G ^A+i, 

% Uj - : = Coef^)^^ + a Xj t x .)) G ^ +1 [t ft ], 

5j>j := Coef(uj-)(gj-(a;Xj)) G Tl' h+1 ; equals if Uj G 

: = Coef(u j )(g j (x Xj + OL X .t x . )) G 
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Let, 

Bj := {uj : Zj jU j 1S i n the F(t/ l +i)-basis of the coefficients of fj} and 

Bj := {uj : Zj jUj is in the F(t/ l+ i) -basis of the coefficients of Tjj} 

with respect to some fixed basis that comprises coefficients of monomials of as low support 
as possible. Note that Bj = Bj =: Bj, as fj is £/j + i-concentrated over TZ' h+l . 

The crucial observation is that, for any Vj G Bj, z'j v . gets a i^-free contribution only 
from the monomial x Vj , thus, its basis representation looks like: 



= C 1 + a ( v j > V J )) • Hvj + Yj a( u j,Vj)- Zj,u 3 ■ 



where a's are in F(t^ +1 )[^] and th divides each a{-,Vj). Similarly, 



:', „ - (f + b(vj,Vj)) ■ Zj jV . + b(uj,Vj ) • : , 



j,Vj — V A t W \ U V u 3l) ' "Wj '• '"' "./• ' J > -J- 11 1 ' 



with u,-th column z 1 , ,, , 
with Uo-th column 



where 6's are in F(t/ l+1 )[t/ l ] and th divides each b(-,Vj). Now define the following matrices: 
Zj G ([k h+1 ] x Bj -> ¥(t h+ i)) ; with Uj-ih column zj tUj , 

z;e([fc w ]xB^F(t„)) 

From the above crucial observation, 

(8) Z'j = Zj • M' and Z'j = % • M', 

where M',M' G x £>.,■ — )■ F(t/ l+ i)[t/ l ]) with rows indexed by itj G Bj and columns 
indexed by Vj G -Bj. The (uf, Vj)-th entry of M' contains a(v,j,Vj) if Uj ^ Vj, otherwise 
f + a(uj, Vj) if Uj = Vj. Similarly, the (uj,Vj)-th entry of M' contains b(uj, Vj) if Uj ^ Vj, 
otherwise l+b(uj,Vj) if Uj = Vj. Note that both M' and M' are invertible over ¥(th+i)(th) 
as det(M') = det(M') = 1 (mod t h ). Therefore, 

(9) Z'j = Z'j ■ (M'~ l M') and Z] = Z] ■ {M'^M'y 1 . 

Now observe that any coefficient of Dht(x + at) is an F(t/ l )-linear combination of the 
columns of ®j^[g\Zj (by the definition of Bj), which by Equation [8] (& Lemma [sj- (4) ) is 
an F(t/ l )-linear combination of the columns of ®j£[i]Zj - this in turn is an F(t^)-lmear 
combination of the columns of ®j^Z'- (by Equation 9). By Equation[?J any F(t/ l )-linear 
combination of the columns of ®je[e\Z'j can be expressed as an F(t/ l )-linear combination 
of those columns u of ®j^[i]Z'j for which bs(u) < I, which in turn can be expressed as an 
F(t/j)-linear combination of those columns u of f° r which bs(u) < I (by Equation^ 

again). In other words, we have shown the following: D^^x + ott) = (mod Vi(Df l /(x + 
at))). This would imply that D' he (x) = (mod Vi{D' he )), if we choose a so that the 
map th,i i-> ah^ h h ' % ensures that fj{a.x :j ^x j ) 1 is well- defined in lZ' h+1 (th)- Such an a 



can be constructed, by Lemma 17 in time polynomial in = (d H ~ h ~ l + n + £h+xY h+1 - 
Therefore, Th : Xi \-t Th+i{xi) + oth,it h ' l is such that Th(Dh(x)) is ^-concentrated over 
7l' h+1 [th\- Since Ch{x) = c T ■ Dh(x), hence Th(Ch{x)) is ^-concentrated over lZ' h . This 
finishes the construction of Th, given Th+i, in time (d + n + £h)°^ h - D 
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Lemma 17 (Preserve invertibility) . Let f £ H K (F)[cc] be a polynomial with degree bound 
5. Assume that f is £' -concentrated over H K (F), and that f~ l £ H K (F(a;)). Then, we can 
contruct an a £ ¥ n , in time polynomial in k(5 + n + t'Y' , such that /(a) -1 £ H re (F). 

(Proof in Appendix [Dj) 

5. Reading off the hitting-set 

5.1. Proof of Theorem [TJ Suppose we are given a blackbox access to a set-height- H 
nonzero formula C of size s, more so we can think of C = Co £ Co(k,d,\,x). Using 
Theorem 15 we can construct a map to : ¥[x] *— > ¥[to][x] such that D := tq o Dq is 
^-concentrated over ft'Jio], in time (d + n + e ) o( - £() \ Clearly, D £ H fc (F[t ])[x] and 
C := tqoC = c T -D. For X C [n] of size at most £q, define ax '■ %j h-> (xj if j £ X, else 0) 
for all j £ [n]. Clearly, ax°C is only £o- v ariate, thus it has sparsity (d H +£o)°^°\ By the 
assumption on D we know that there exists such an X for which ax ° C" ^ 0. Thus, using 
standard sparse PIT methods (see [BHLV09;) we can construct a hitting-set for C", in 
time (d H + n + e )°^ = 2°(^Hio g ( s +4)) _ exp (O(£ H 2 log a)), which is time polynomial 
in exp((2i? 2 logs)^ +1 ). □ 

5.2. Proof of Corollary pj Suppose we are given a blackbox access to a semi-diagonal 
formula C = Y2i=i m * ' 11^=1 h f over F, where rm is a monomial, fij is a sum of 
univariate polynomials, and b is a constant. Call its size s. 

Assume p := char(F) is zero (or larger than max^-je^}). Using the duality trick (see 

[SSS121 Theorem 2.1]), there exists another representation of C as C := Yli=i YYj=i 9i,j( x j^ 



of size s°( b \ Rewrite this, using the obvious Hadamard algebra H^'(F), as - C = c T ■ D, 
where D = Gi(x\) * ■ ■ ■ * G n (x n ) £ Hfc'(F)[x]. Trivially, the monomial- weight of each Gj 
is bounded by 1. Thus, by invoking Theorem [9] (& the 'key argument' in Lemma 16) we 



can shift D, in time s°( logk '\ such that it becomes 0(log fc')-concentrated. On top of the 
shift, the usual sparse PIT gives a hitting-set for C in time s°^ ogs \ □ 

5.3. Proof of Corollary [3j Suppose we are given a blackbox access to the formula 
C = Y2i=i TYj=i fij( x Xj ;) 6i,J i where foj is a sum of univariate polynomials in Ffa;^], 
eij £ N, and X\ U • • • U X& partitions [n]. Let the formula size be s. 

Assume char(F) is zero (or larger than maxjj{ejj}). Using the duality trick (see 
[SSS12J Theorem 2.1]), there exists another representation of fi ) j{xx j ) ei ' 3 as Fij := 

Ylp=i IlgeXj 9i,j,p,i( x <i) °f s i ze s °^- Trivially, the monomial-weight of each gij tPl q is 
bounded by 1. Overall, we can represent C now as C' := Ya=i Ilj=i Fiji which is a 
set-depth-6 formula. Recall the inductive proof of Theorem |15| on C' . It will have H = 3 
inductive steps. The crucial observation is that in the base case (dealing with sparse 



polynomials) we can use a better bound £' = 2 in Lemma 16 as p(gij,p,q) ^ 1- This leads 
us to an improvement on Theorem 15 - we construct tq such that tq o Dq is 0(log 2 s)- 



concentrated over 7^[to], in time polynomial in s g s . Again, on top of the shift, the 
usual sparse PIT gives a hitting-set for C in time s°( log s \ □ 

6. Conclusion 

We have identified a natural phenomena - low-support rank-concentration - in constant- 
depth formulas, that is directly useful in their blackbox PIT (up to quasi-polynomial time). 
In this work we gave a proof for the interesting special case of set-depth-A formulas. 
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More work is needed to prove such rank-concentration in full generality. Next, it would 
be interesting to prove rank-concentration for depth-3 formulas. Another direction is to 
improve this proof technique to give polynomial-time hitting-sets for set-depth-A formulas. 
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Appendix A. Diagonal circuits: The spirit of the argument 

A circuit C = X^=i ft ls a diagonal circuit if fi is a linear polynomial in n variables, 
x. []] We can associate a formula over a Hadamard algebra with C, namely 

D(x) := F d overH fc (F), 

where F = z$ + z\X\ + . . . + z n x n , every Zj S ¥ k and F restricted to the i-th coordinate of 
the vectors zq, . . . , z n is the linear polynomial fi. Clearly, C = (1, 1, . . . , 1) • D(x), where 
• is the usual matrix product. Assume that char(F) = or > d. 

Consider shifting every Xj by a formal variable tj, i.e. Xj \— )■ Xj + tj. Then, 

D(x + t) = F(x + t) d = D(t) * (1 + z[ Xl + ... + z' n x n ) d =: D(t) * D'(x), 

where z'j = D(t)~ 1 Zj. We have stated before (in Section [l| that variables would be 
ultimately shifted by field constants. Here is a way to set tj a field constant: To ensure 
that Z?(t) -1 makes sense when tj's are set to constants, we map tj t-t yi where y is a fresh 
variable and then set y to an a £ F such that a is not a root of any of the polynomials 
fi(y,y 2 ,...,y n ), 1 < i < k. With this setting, we can safely assume that D(t) and 

4,...,<eH fe (F). 

Clearly, C(x + t) = (1, 1, . . . , 1) ■ D(x + 1) is zero if and only if C = 0. We would like to 
show that for I = [log A;] , C(x + t) is ^-concentrated over F. The coefficient of a monomial 
* e = n ieW m D(x + t ) is D{t) * Coef (e)(D') = QD(t) * U M n] Z P = Q D (*) * 
where Q = ( d ; ). For a moment, treat z' e as a 'monomial' in z\, . . . ,z n . List down 
all monomials in zi,...,z n with degree bounded by d in degree-lexicographic order. The 
idea is to form a basis of sp F {Coef (e)(D') \ e G N™} by picking terms z' e , the coefficient 
of x e in D' (upto scaling by ( e )), from the ordered list. We pick a term z^ . . . z e ™^ 
(ej > 0) from the ordered list if it is not in the span of the already picked terms. The 

■*"A lemma by Ellison |E1169| states that every n-variate polynomial of degree d over C has a diagonal 
circuit representation although k can be exponentially large. 
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claim is, if zj* . . . z?™ (ej > 0) is picked then so are the terms IlreS z ir > f° r ever y se t 
S C [m] - this follows easily from the degree- lexicographic ordering of the list. This 
implies that m < \logk] = £, as dimension of sp F {Coef(e)(L>') | e G N n } is bounded 
by k and there are 2 m such terms YlreS z ir' Therefore, D'(x) is ^-concentrated over 
Hfc(F) which implies that D{x + t) = D(t) * D'{x) is ^-concentrated over H^(F). Since, 
C(x + t) = (1, 1, . . . , 1) • D(x + t), C(x + t) is also ^-concentrated over F. 

Thus, by shifting xj h-> CEj + a- 7 , where a G F is such that none of the fi(a, a 2 , . . . , a n ) 
is zero, we are guaranteed that the shifted diagonal circuit satisfies [log £;] -concentration. 
Such an a is always present among a set of kn + 1 distinct elements of F. A quasi- 
polynomial hitting set generator for C(x) ensues immediately (as sketched in Section [T]). 

Appendix B. Missing proofs of Section [2] 
B.l. Proof of Lemma 0. 

Proof. Recall that fj(x Xj ) = (fij(x Xj ), ■ ■ . , fk,j(x Xj )) T , where every fij(xx 3 ) is a set- 
height- (H — h — 1) formula over TZh- The proof is by induction on height (H — h — 1) of 
fj(xx J ) (in other words, reverse induction on h). 

Base case (h + 1 ^ H — 1): The base case is when H — h — 1 = 1 or 0, i.e. /ij(£Cx,)' s are 
sparse polynomials or linear polynomials depending on whether A is even or odd, repec- 
tively. In this case, fj{xx j ) is a set-height-(ii— h— 1) formula over TZh+i- Also, the sparsity 
parameter A remains the same by its definition. Hence, fj(xx j ) G Ch+\{k, d, A, x Xj )- (Here 
we do not care about the partition.) 

Inductive step (h + 2 to h + 1): The crucial property to note here is that the formulas 
/j,j(^Xj )'s appear as sub-formulas of Ch at depth-3 (Equation [Tj) . Therefore, the cor- 
responding n-layers of fij {xxj ),■■■, fk,j( x Xj) respect the same partitions of xx y In 
particular, we can express every fij(xx 3 ) as, 

k d 
p=l q=l 

where G TZh, gi,j, P ,q(xY ) is a set-height- (H — h — 2) formula over TZh, an d the first 
n-layer of all fij(x Xj ), for 1 < i < k, respect the same partition 7-^(2, Xj). In other 
words, Yj >q 's partition Xj as do X2 ;Q D Xj. (Note: With j fixed, here X^^ q n Xj are the 
only relevant variable indices.) Hence, 

k d 

(10) fj (PXj ) = Yl b 3>P " II Sj,P,1 ( X Y M ) , 

where 6 J)P = • • • , &fc,j, P ) T G TZ h+1 an d 9j,p,q{x Y ^ q ) = (9l,j,p,q(xY jjq ), 9k,j, P ,q(x Yj>q )) T 

G 7th+i[ajy M j. 

In order to apply induction, we make a comparison between fi^{xx j ) and 9i,j,p,q{xYj q ) 
(and between fj(xx j ) and gj^^ixy^ q )). Just like fij{xxj) is a set-height- (if — /i — 1) 
formula over 7?./,, occurring as a sub- formula at depth-3 of the formula Ch, gi,j,p,q{xYj q ) is a 
set-height- (if — ft,— 2) formula over IZh occurring as a sub- formula at depth-5 of the formula 
Ch- Hence, by induction, gj, P ,q{xYj q ) is a set-height- (if — h — 2) formula in TZh+i [xy. ] 
with S-fanin /c, n-fanin d and sparsity parameter A i.e., 9j,p,q{xy j q ) G Ch+2{k, d, A, x^. ), 
such that every /i'-th n-layer of 9j,p,q(xy j q ) respects the partition Vh(h' + 2, lj l9 ). Since 
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9j,P,qi x Y ) has only variables xy. and Yj >q C Xj, we can also say that every h'-th II- layer 
of 9j,p,q{xYj q ) respects the partition Vh(h' + 2, Xj). The h'-th Il-layers of the gj tP ^ q (xy j Vs 
(for 1 < q < d) correspond to the (h! + l)-th II-layer of fj(xx i )- Hence, by Equation 



10 



'3 

we infer that every /i'-th II-layer of fj{xx j ) respects the partition Vh{h' + l,Xj). Note 
that the S-fanin, II-fanin and the sparsity parameter remain k, d and A, respectively. This 
proves the claim. □ 

Appendix C. Missing proofs of Section [3] 
C.l. Proof of Lemma 1101 

Proof. Consider a column u E S of Z'; it is z' u . Now 

4 = f(t)~ l *J2 Z "( V ) tV ~ U [by E quation|4) 

= f{ty l * Z ■ (u-th column of N S TN S x ). 
Running over all n € S gives us the result. □ 

C.2. Proof of Lemma II li 



Proof. Lemma 10 gives Z' s = f(t) 1 * ZN$Ts t sN s 1 . Rewrite it as, 

f(t)- 1 *Z = Z' s N s T'N s 1 . 

Going modulo the subspace sp F ( t ){2:Q} kills the 0-th column of Z' s and yields, 

f(t)- l *Z^Z' s »N s ,T' s ^ s N s l (mod 4). 

For the second part we exploit the independence of T' s * s from Z and the Hadamard 
algebra. Formally, fix a large enough k, say |<S|, and the Hadamard algebra H^(F). Let 
e G S. Fix Z as: Its e-th column is and the rest are linearly independent modulo 1 (note: 
1 = z'q). For this 'generic' setting we still have the equation, /(i) -1 *Z = Z' s *Ns*Tg* gN^ 1 
(mod z' ). Implying, 

f(t)- 1 -k Z S \ {e } = Z's* N s*Ts* >s \{ e }Ns\ {e} (mod %). 

Since the LHS is a matrix of rank |5| — 1, we deduce that T' s * §\i e \ is invertible. In other 
words, T' s * s is strongly full. □ 

C.3. Proof of Lemma 1121 



Proof. For i E [I], we can apply Lemma 11 to and get, 

(11) f^-^Zi^ZiNs^Ng 1 (modi) 

where the 1 is the unity, the all one vector, in H K (F). Denote the tij-th column of the 
matrix on the RHS, of the above congruence, by Cj Ui . 
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Consider a column u G S of Z; it is z u . Now 

ie[<] 

= j^J (a, + Cj )Ui ) [for some a, G F(t) by Equation 11 



ie[<] 

Cj jUj (mod V^(-D')) ['.• the product of £ or less Cj iUi vanishes] 



Running over all u G 5 gives us 



= {®im Z 'i)-®im( N St T l N S-) [by Lemma 
= Z' • A/5/ • T' • A^ 1 (modV^(D')) [by Lemma §(1)] 

□ 

C.4. Proof of Theorem SHI 

Proof. We know that T" = ®i e m2| , where each T[ G (5* x S\ — > F) is strongly full (Lemma 



lllfor fi). Thus, we can apply invertible row operations Ei G (S* xS* — > F) such that E{r[ 
has a 1 5* [-sized identity submatrix, and another column that has only nonzero entries. 

Since, from now on, we are not going to use the properties of the index sets S*,Si, 
we replace them by a more readable identification: Define, for i G [£], rii := |<S*| > 
and identify S* (resp. Si) with Ui := [ni\ (resp. W» := [0..nj]). Let U := XfeunUi and 
^ := ^ieW^i- Wlog we keep the following setting: For all i G [•£], 

(1) {T' i )u i) u i = I ni [by Lemma[8|(l), and taking to be our new T-], and 

(2) the column (T!)u u q is zero free. 

Define an indicator function (note: £(•) equals 1, if the boolean condition is true, else 0) 
e : N >0 x N -> {0, 1}; (u,w)^i((w = 0)V(ro/0Aw = u)) . 

Extend it to N> x N e by defining e : (u, w) h-> IlreM £(u T ,w r ). 

Note that the (n,w)-th entry in T' is nonzero iff s(u, w) = 1. Thus, e exactly indicates 
the non-zeroness in TL 

Similarly, by tensoring, the (u, w)-th entry in T' G (U x W — > F) is nonzero iff e(u, = 
1. Thus, e exactly indicates the non-zeroness in T' . 

We will build C incrementally, starting with C = 0. During this build up we might apply 
row permutations R on T'. 

Consider a column u, u G U C W, of T' . This column has exactly one nonzero entry; 
appearing at the row indexed by u G U. Put all these unmarked columns u in C, and 
collect the marked ones in A4\. 

If M\ = then we already have |C| = |?7| and we are done (infact, T^ c is identity). So 
assume \Adi\ ='■ mi G [k] and define ni2 '■= k — mi < k. Let the other marked columns 
be A^2 '■= M \ Mi; they lie in W \ U and are mi many. 

Consider the unmarked columns in W\U; collect them in C := W\ (U DM2)- We will 
now focus on the submatrix T' Mi =: T[. Note that its column-indices are ^-tuples 
with at least one zero. 
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Claim 18. There exists a row-permutation Ri £ F miXmi , and m\ unmarked columns 
C\ C C such that: (RiT[)_\4 lt c 1 is a lower-triangular m\ x m\ matrix with w-th (w £ C\) 
diagonal entry being nonzero. 



Proof of Claim 18. We will again build C\ incrementally, starting from 0. 

Recall that each row of T{ is indexed by an £-tuple u in U. For i £ [£] we denote the 
i-th coordinate in u by u(i), and for an / C [£], u(I) denotes the ordered set {u(i)\i £ I}. 
For w £ W, define the support S(w) := {i € [£] \ w(i) ^ 0}. We want to permute the 
rows so that the coordinates of the row-indices appear in a decreasing order of frequency. 
Formally, pick R\ £ ]p m i xm i to reorder the rows of T{ as Aii = (ui, . . . , u mi ) such that: 

• The ordered list tii(l), . . . , u mi (1) has repetitions only in contiguous locations and 
the frequencies are non-increasing. In equation terms: The list has some r distinct 
elements a\, . . . ,a r £ U\ with respective frequencies %\ ^ • ■ • ^ i r (summing to 
mi), and they appear as a\{i\ times), . . . , a r (i r times). 

• The ordered list «i(2)), . . . , (u mi (l), u mi (2)) has repetitions only in con- 
tiguous locations and the frequencies are non-increasing. 

• The same as above holds for 3-tuples, 4-tuples,. . .,£-tuples. 

We now describe an iterative process to build C\ one element at a time. In the i-th 
iteration, i £ [mi], we will add an unmarked, unpicked column Wi £ C to C\. The process 
maintains the invariant: (R\T[)_\4 lt c 1 is a lower-triangular matrix. 

Iteration i = 1 - The row u\ of T[ has exactly 2^ — 1 nonzero columns. (Why? Zero-out at 
least one coordinate of u%.) Since 2 l — 1 ^ k > |7W 2 1 we can pick a column w\ £ C such 
that e(ui,wi) / 0, thus (T[) UltWl ^ 0. Add w\ to C%. 

Iteration i ^ 2 - Consider the list u\ , . . . , Ui . We claim that there are positions I C [£] , 
|/| ^ fig*! j suc h that Ui{I) is not contained in any of the previous sets in the list. The 
proof is by binary-search in the list. Start with 1 = 0. Pick the least ji £ [I] such that 
ui(ji), . . . ,Ui(ji) are not all the same; add j\ to /. By the ordering on -u's the frequency 
li\ of Ui(ji) is at most i/2. If it is one then we stop with this /, otherwise we zoom-in 
on the 'halved' list Uj_ Ml+ i, . . . , Ui. Again we pick the least j'2 £ [j\ + 1,£] such that 
""■•i— /.ii+i (j'2) , • • • ,Ui(j2) are not all the same; add j'2 to /. This leads to a further halving 
of the list, and so on. Finally, we do have our positions /, |J| ^ fig «~| , such that Ui{I) 
appears for the first time in Uj. 

We deduce that each column w of T[, with / C S(w) C [£] and w(S(w)) = Ui(S(w)), has 
the first nonzero entry at the Uj-th row. (Why? Consider e(uj, w) = e(uj(S(w)), w(S(w))) = 
e(uj(S(w)), Ui(S(w))).) The number of such columns w, that are unmarked and unpicked, 
is at least (2^l / l-l)-m 2 -(i-l) ^ 2 e ~W-K ^ 2 i ~^-K > 2?-fl*"\ -k = 2^ k ^ +1 -k > 0. 
So we can pick such a column, say, w% £ C \ C\ and add to C\ . 

Note that the square submatrix of T[ thus far, (RiT[)^ ult „ mjUi \fi 1 is lower-triangular with 
a nonzero diagonal. 

After the iteration i = mi - The square matrix {R\T[)j V [ 1 fi 1 is lower-triangular with a 
nonzero diagonal. 

This finishes the claim. □ 

Since R\ permutes the rows of T[, its action can be lifted to the rows of T'\ call this 
action R. Also, append C\ to the current C (making its size \U\). Define Ai± := U \ Ai\ 
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and C\ := C \ C\. Consider the square matrix {RT')uc- It looks like, 









I M 1 ,C 1 


( RT ')m u Ci 




(RT') MuCl 




. °Mi,Ci 


(RiT[) MltCl 



Clearly, its determinant equals \{R\T[)j v i 1 fi 1 \ 7^ 0. Thus, \Ty C \ 7^ and we are done. □ 

C. 5. Proof of Lemma 1141 

Proof. Let a be the v-th column of A. Let a' G F^l be the vector having the entries of a 
appearing at the rows Ai. Consider (T'N^ 1 ) ■ a. By the property of a we can write, 

(T'N^a = (T'N^s^v + iT'N^s^M-a' 
= T^^.t-v + iT'N^s^M-a'. 

Thus, the f-th column of A has the leading monomial t~ v which 'contributes' the vector 
T' s , . Going over the columns a, running v G C, by the column-linearity of determinant 
and the multiplicativity of the inverse-monomial ordering, we deduce that the largest 
possible (inverse-monomial) term in the expression IT'N^AI is: 

\T' slfi \-t-^ v - 

We know this is nonzero, by the property of C, thus it is indeed the leading term. In 
particular, \T'N^A\ / 0. □ 

Appendix D. Missing proofs of Section [4] 

D. l. Proof of Lemma 1161 

Proof. If 2 [~log 2 (K • s(/))] ^ fJ-(f) then £' = 1 + /u(/). In this case trivially, for any shift 
a, o-(f) is ^'-concentrated over H K (F(i)). So, from now on we assume 2 [log 2 (K • s(/))] 
<fi(J), thus £' = 1 + 2 riog 2 (K-s(/))l. 

Define 1Z := H s( -j)(H K (F)). Let / =: Yl e eS(f) z e% e - Define a column vector D G (S(/) x 
[1] —J- H K (F[;r])) with e-th entry being z e x e \ D can be seen as a polynomial over 1Z. Rewrite 
D product of univariate polynomials over 7Z as: 

D{x) = gi(xi) -k ■ ■ ■ -k g n (x n ). 

Clearly, each gi has degree, hence sparsity, bounded by 5, and can be seen as an element 
in H K . s(/) (F)[xi]. 

For any X C [n] of size £' , define Dx{x) := Yliex 9i( Xi )- Recalling Theorem [9] we 
can construct a shift a for Dx, such that c o Dx is ^'-concentrated, in time polynomial 
in (5 + n + . Using induction on the number of variables, it is easy to see that if 
a o Dx is ^'-concentrated (Wf G (}?))) then so is a o D. The A;ey argument is: Since 
the constant coefficient in each (i.e. shift- ^-normalized <?j) is one, deduce that the 
coefficient of any term in D' (i.e. shift-&-normalized D) of block-weight ^ £' is produced 
by the product of some ^ £' g-'s, so this case is covered by some X G (}?})• Also, deduce 
that the coefficient of any term in D' of block- weight > £' can be inductively written down 
as a linear combination of {Coef(e)(D') | e G N ri ,s(e) < £'}. Finally, a o D inherits this 
concentration property from D' . 

Recall / = 1 T ■ D, where 1 is the unity in 1Z = H s /«(H re (F)). Thus, from the £'- 
concentration of cro D (over 1Z), we can deduce the ^'-concentration of a o f (over H K (F)). 
This completes the construction of a. □ 
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D.2. Proof of Lemma 1171 

Proof. View / as a vector with k coordinates; each entry is in F[a;]\{0}. Call the i-th entry 
fi. Clearly, /j has variables (resp. degree) at most n (resp. 5). Also, by the concentration 
property there exists ej G N n , with s(ej) ^ £', such that Coef(ej)(/j) / 0. 

For X C [n] of size at most £' , define ax ■ Xj \— > (xj if j G X, else 0) for all j G [n]. 
Clearly, ax ° fi is only £' variate, thus it has sparsity (5 + £')°^\ By the assumption 
on fi we know that Xi := S(ej) is of size at most £' , and crx* ° /» ^ 0. Using standard 
sparse PIT methods (see [BHLV09 ), we can construct a hitting-set for crjq o /j in time 
(5 + £')°^'\ Varying over all subsets X C [n] of size at most £' , we get a hitting-set for fi 
in time ). For convenience, denote this hitting-set as a set of evaluation-maps 

{an, . . . , ai tr }; each map is from x to F and we write crjj o /j to mean fi(aij(x)). Overall 
we are ensured the existence of a j, for a given i, such that crjj o /j ^ 0. We will now show 
how to combine all these into a single map. 

Pick distinct nr elements j3i t ±, . . . , /3 K/r G F. Consider the univariate polynomial g(u) := 
IlieW je[r]( u ~ Define gij(u) := g(u)/(u — Pij), for all i, j. Consider an evaluation 

map from ¥[x] to F[u, v] - a := v ■ J2ie[ K ] je[r] 9i,j( u ) ' a i,j- We claim that, for all i G [«], 
a ° fi 7^ 0- To see this, note that there is some j G [r] for which <7j j o f% ^ 0. Further, 
let /j- be a homogeneous part of /j, say of degree Si, such that <7jj o fi ^ 0. Consider the 
partial evaluation (a a fi)(f3ij,v) = fi(v ■ gij(Pi,j) ■ aij(x)). Here the coefficient of the 
monomial v Si is gi,j{Pi,j) Sl • (cjj o /?) ^ 0. Consequently, er o fi ^ 0. 

Thus, for all i G [«], c o fa is a nonzero bivariate polynomial in ¥[u, v]. Since its degree 
remains bounded by 5 ■ nr, we can again apply [BHLV09] to replace u, v by a hitting-set. 
Finally, we hit an a G F n , in time polynomial in k(5 + n + £') , such that for all i G [«], 
fi(a) 7^ 0. This finishes the proof. □ 
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